NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Improving Code Extraction from Coding Screencasts Using a Code-Aware Encoder-Decoder Model

https://doi.org/10.1109/ASE56229.2023.00184

Malkadi, Abdulkarim; Tayeb, Ahmad; Haiduc, Sonia (September 2023, 2023 38th IEEE/ACM International Conference on Automated Software Engineering (ASE))
Extended Abstract of A Comparative Study and Analysis of Developer Communications on Slack and Gitter

https://doi.org/10.1109/SANER56733.2023.00097

Parra, Esteban; Alahmad, Mohammad; Ellis, Ashley; Haiduc, Sonia (March 2023, 2023 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER))
A comparative study and analysis of developer communications on Slack and Gitter

https://doi.org/10.1007/s10664-021-10095-1

Parra, Esteban; Alahmadi, Mohammad; Ellis, Ashley; Haiduc, Sonia (March 2022, Empirical Software Engineering)

Full Text Available
UI Screens Identification and Extraction from Mobile Programming Screencasts

https://doi.org/10.1145/3387904.3389265

Alahmadi, Mohammad; Khormi, Abdulkarim; Haiduc, Sonia (July 2020, Proceedings of the 28th International Conference on Program Comprehension)
null (Ed.)
Mobile applications demand is on the rise, leading to more programmers learning to develop or having to maintain this kind of programs. Developers often refer to online resources to find inspiration or answers to questions they have about mobile programming topics and screencasts are a popular resource. However, given the multitude of screencasts available, it can be difficult to quickly comprehend which of the many videos is relevant to one's needs. We propose a novel approach, called UIScreens, which detects, extracts, and presents the most representative user interface (UI) screens embedded in mobile development screencasts. This could help developers quickly comprehend what an app displayed in a video is about, therefore saving time searching for useful videos. UIScreens has been evaluated in two empirical studies on iOS and Android programming screencasts. The first study investigates the accuracy of our UI extraction and shows that our approach is able to detect and extract UI screens with an accuracy of 94%. The second is a user study with mobile app developers, who evaluated both the accuracy and the usefulness of UIScreens. They agreed that UIScreens is accurate and extracts representative UI screens from videos. They considered that the extracted UI screens are useful for understanding what a video is about and if it is relevant to a search. Our approach has been implemented as a free online tool.
more » « less
Full Text Available
GitterCom: A Dataset of Open Source Developer Communications in Gitter

https://doi.org/10.1145/3379597.3387494

Parra, Esteban; Ellis, Ashley; Haiduc, Sonia (June 2020, Proceedings of the 17th International Conference on Mining Software Repositories)
null (Ed.)
Team communication is essential for the development of modern software systems. For distributed software development teams, such as those found in many open source projects, this communication usually takes place using electronic tools. Among these, modern chat platforms such as Gitter are becoming the de facto choice for many software projects due to their advanced features geared towards software development and effective team communication. Gitter channels contain numerous messages exchanged by developers regarding the state of the project, issues and features of the system, team logistics, etc. These messages can contain important information to researchers studying open source software systems, developers new to a particular project and trying to get familiar with the software, etc. Therefore, uncovering what developers are communicating about through Gitter is an essential first step towards successfully understanding and leveraging this information. We present a new dataset, called GitterCom, which aims to enable research in this direction and represents the largest manually labeled and curated dataset of Gitter developer messages. The dataset is comprised of 10,000 messages collected from 10 Gitter communities associated with the development of open source software. Each message was manually annotated and verified by two of the authors, capturing the purpose of the communication expressed by the message. While the dataset has not yet been used in any publication, we discuss how it can enable interesting research opportunities.
more » « less
Full Text Available
A Study on the Accuracy of OCR Engines for Source Code Transcription from Programming Screencasts

https://doi.org/10.1145/3379597.3387468

Malkadi, Abdulkarim; Alahmadi, Mohammad; Haiduc, Sonia (June 2020, Proceedings of the 17th International Conference on Mining Software Repositories)
null (Ed.)
Programming screencasts can be a rich source of documentation for developers. However, despite the availability of such videos, the information available in them, and especially the source code being displayed is not easy to find, search, or reuse by programmers. Recent work has identified this challenge and proposed solutions that identify and extract source code from video tutorials in order to make it readily available to developers or other tools. A crucial component in these approaches is the Optical Character Recognition (OCR) engine used to transcribe the source code shown on screen. Previous work has simply chosen one OCR engine, without consideration for its accuracy or that of other engines on source code recognition. In this paper, we present an empirical study on the accuracy of six OCR engines for the extraction of source code from screencasts and code images. Our results show that the transcription accuracy varies greatly from one OCR engine to another and that the most widely chosen OCR engine in previous studies is by far not the best choice. We also show how other factors, such as font type and size can impact the results of some of the engines. We conclude by offering guidelines for programming screencast creators on which fonts to use to enable a better OCR recognition of their source code, as well as advice on OCR choice for researchers aiming to analyze source code in screencasts.
more » « less
Full Text Available
Experiences Building an Answer Bot for Gitter

https://doi.org/10.1145/3387940.3391505

Romero, Ricardo; Parra, Esteban; Haiduc, Sonia (June 2020, Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops)
null (Ed.)
Software developers use modern chat platforms to communicate about the status of a project and to coordinate development and release efforts, among other things. Developers also use chat platforms to ask technical questions to other developers. While some questions are project-specific and require an experienced developer familiar with the system to answer, many questions are rather general and may have been already answered by other developers on platforms such as the Q&A site StackOverflow. In this paper, we present GitterAns, a bot that can automatically detect when a developer asks a technical question in a chat and leverages the information present in Q&A forums to provide the developer with possible answers to their question. The results of a preliminary study indicate promising results, with GitterAns achieving an accuracy of 0.78 in identifying technical questions.
more » « less
Full Text Available
GUI-focused overviews of mobile development videos

https://doi.org/10.1145/3377812.3390900

Alahmadi, Mohammad; Khormi, Abdulkarim; Haiduc, Sonia (June 2020, Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering: Companion Proceedings)
null (Ed.)
The need for mobile applications and mobile programming is increasing due to the continuous rise in the pervasiveness of mobile devices. Developers often refer to video programming tutorials to learn more about mobile programming topics. To find the right video to watch, developers typically skim over several videos, looking at their title, description, and video content in order to determine if they are relevant to their information needs. Unfortunately, the title and description do not always provide an accurate overview, and skimming over videos is time-consuming and can lead to missing important information. We propose a novel approach that locates and extracts the GUI screens showcased in a video tutorial, then selects and displays the most representative ones to provide a GUI-focused overview of the video. We believe this overview can be used by developers as an additional source of information for determining if a video contains the information they need. To evaluate our approach, we performed an empirical study on iOS and Android programming screencasts which investigates the accuracy of our automated GUI extraction. The results reveal that our approach can detect and extract GUI screens with an accuracy of 94%.
more » « less
Full Text Available
UIScreens: extracting user interface screens from mobile programming video tutorials

https://doi.org/10.1145/3368089.3417935

Alahmadi, Mohammad; Tayeb, Ahmad; Khormi, Abdulkarim; Parra, Esteban; Haiduc, Sonia (November 2020, Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering)
null (Ed.)
Mobile apps are one of the most widely used types of software systems in existence today and more programmers and students learn how to develop them everyday. One of the most popular resources for learning mobile programming are videos hosted on social platforms such as YouTube. While useful, this type of resource has also its limitations, especially when developers are looking for user interface (UI) designs for mobile applications, since these are hard to search for and locate in videos. We propose UIScreens, a web-based analysis and search engine that analyzes the visual contents of mobile programming video tutorials, then identifies and extracts the UI screens displayed in the videos. Our tool offers features such as searching for UI screens in videos, displaying an overview of the UI screens identified in a video under each search result, and navigating to the part of a video where a particular UI screen is being displayed and discussed. In a user study, participants agreed that UIScreens is usable and useful to quickly skim through videos, while the UI screens it extracts can help developers further determine the relevance of videos to a search topic.
more » « less
Full Text Available
On the relationship between bug reports and queries for text retrieval-based bug localization

https://doi.org/10.1007/s10664-020-09823-w

Mills, Chris; Parra, Esteban; Pantiuchina, Jevgenija; Bavota, Gabriele; Haiduc, Sonia (September 2020, Empirical Software Engineering)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records